Process Scheduling in Heterogeneous Multiprocessor Systems Using Task Duplication
نویسندگان
چکیده
Scheduling tasks in heterogeneous parallel and distributed computing environments continues to be a challenging problem. In this article, the authors investigate the Heterogeneous Earliest Finish Time (HEFT) algorithm, along with alternative scheduling policies for task prioritising phases and the Critical Path on a Processor (CPOP) for scheduling tasks on a heterogeneous multiprocessor system. The authors show that by combining the HEFT algorithm selection policy with the task duplication strategy, it is possible to further reduce the schedule length produced by both HEFT and CPOP. The process scheduling algorithm presented in this article compares favourably with other algorithms that use a similar strategy. The proposed algorithm has a time complexity of Ο(│V│2(p + d)), where V represents the number of tasks, p represents the number of processors and d the maximum in-degree of tasks. DOI: 10.4018/jbdcn.2010010104 International Journal of Business Data Communications and Networking, 6(1), 58-69, January-March 2010 59 Copyright © 2010, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. It is assumed that there is one entry task na and one exit task nw for the DAG. In both homogeneous and heterogeneous environments the process scheduling problem is NP-complete (Garey & Johnson, 1979; Graham, Lawler, Lenstra, & Kan, 1979; Kasahara & Narita, 1984; Ullman, 1975). In homogeneous environments several heuristics were proposed using both list scheduling without task duplication (Ahmad & Kwok, 1996; Wu & Gajski, 1990) and list scheduling using task duplication (Ahmad & Kwok, 1998; Bansal, Kumar & Singh, 2003; Chaudhuri & Elcock, 2005; Park & Choe, 2002). In list scheduling, tasks are assigned a priority and based on that priority a task is scheduled. In task duplication, some tasks are deliberately scheduled on more than one processor to reduce inter-processor communication. A collection of a diverse set of resources interconnected, in most cases, with a high speed network defines as a heterogeneous/grid environment. A similar list scheduling approach has also been taken in such an environment. The Heterogeneous Earliest Finish Time (HEFT) algorithm along with its alternative scheduling policies, and the Critical Path on a Processor (CPOP) algorithm use this approach (Topcuoglu, Hariri, & Wu, 2002). The HEFT algorithm and its variations, firstly assigns a weight to each node based on the average computational cost, and to each edge, based on the average communication cost. Secondly, they assign a rank, using a rank function which takes into account the assigned weights, to each node. Thirdly, the nodes are prioritized on the basis of their rank value. A task is then scheduled to a processor, which returns the earliest completion time. The CPOP algorithm, on the other hand, also assigns a weight to a task based on a rank function, but it first selects the critical path tasks, defined as those tasks with the highest, but equal, weights. The processor which minimizes the cumulative computation costs of the critical path tasks is then selected and identified as the critical-path processor. Only critical path tasks are scheduled on the critical-path processor while the other tasks are scheduled on any of the remaining processors which return the earliest completion time of the task. The Dynamic Critical Path Duplication (DCPD) algorithm (Liu, Li, Lai, & Wu, 2006), is a duplication-based algorithm which uses a dynamic list-based approach for scheduling in a heterogeneous/grid environment. The DCPD algorithm determines the critical path of the DAG and dynamically selects the next node for scheduling. At each scheduling step the DCPD algorithm identifies the next task for scheduling as the dominant task (Dtsk). To identify the Dtsk, the DCPD algorithm first identifies the ready—tasks where their immediate predecessors have already been scheduled. Each task na in the ready list is then assigned a value based on three components. The first component is b-level(na)— the length of the longest path from task na to the exit node. The second component is u-level(na)—the shortest average length from the entry node to na; and thirdly, the average computational cost of task na. The dominant task, Dtsk, is the task with the maximal value of b-level(na) added to the average computational cost of na minus u-level(na). The DCPD algorithm then identifies the dominant processor, Dpro, as the one which allows the Dtsk to complete the earliest, utilizing idle slots whenever possible during the identification process. The Dtsk is then scheduled on the Dpro. In this article, we have combined the idea of task duplication with the ranking concept used in the HEFT algorithm and developed an algorithm which outperforms the HEFT algorithm, the six (6) alternative policies for the task prioritizing phase of the HEFT algorithm, the CPOP algorithm and the duplication-based DCPD algorithm. The rest of the article is organized as follows. Section 2 introduces the basis of the algorithm. The proposed algorithm and its complexity are provided in Section 3. In section 4, we have presented an illustrative example. Section 5 provides experimental results and compares these results with the performances of the DCPD algorithm and the 10 more pages are available in the full version of this document, which may be purchased using the "Add to Cart" button on the product's webpage: www.igi-global.com/article/process-schedulingheterogeneous-multiprocessor-systems/40914?camid=4v1 This title is available in InfoSci-Journals, InfoSci-Journal Disciplines Communications and Social Science. Recommend this product to your librarian: www.igi-global.com/e-resources/libraryrecommendation/?id=2
منابع مشابه
An Efficient Genetic Algorithm for Task Scheduling on Heterogeneous Computing Systems Based on TRIZ
An efficient assignment and scheduling of tasks is one of the key elements in effective utilization of heterogeneous multiprocessor systems. The task scheduling problem has been proven to be NP-hard is the reason why we used meta-heuristic methods for finding a suboptimal schedule. In this paper we proposed a new approach using TRIZ (specially 40 inventive principles). The basic idea of thi...
متن کاملAn Efficient Genetic Algorithm for Task Scheduling on Heterogeneous Computing Systems Based on TRIZ
An efficient assignment and scheduling of tasks is one of the key elements in effective utilization of heterogeneous multiprocessor systems. The task scheduling problem has been proven to be NP-hard is the reason why we used meta-heuristic methods for finding a suboptimal schedule. In this paper we proposed a new approach using TRIZ (specially 40 inventive principles). The basic idea of thi...
متن کاملPre-scheduling and Scheduling of Task Graph on Homogeneous Multiprocessor Systems
Task graph scheduling is a multi-objective optimization and NP-hard problem. In this paper a new algorithm on homogeneous multiprocessors systems is proposed. Basically, scheduling algorithms are targeted to balance the two parameters of time and energy consumption. These two parameters are up to a certain limit in contrast with each other and improvement of one causes reduction in the othe...
متن کاملPre-scheduling and Scheduling of Task Graph on Homogeneous Multiprocessor Systems
Task graph scheduling is a multi-objective optimization and NP-hard problem. In this paper a new algorithm on homogeneous multiprocessors systems is proposed. Basically, scheduling algorithms are targeted to balance the two parameters of time and energy consumption. These two parameters are up to a certain limit in contrast with each other and improvement of one causes reduction in the othe...
متن کاملModified Genetic Algorithm Approach to Optimize Task Scheduling on Heterogeneous Multiprocessor Parallel System using Node duplication
Task graph scheduling is the important factor which occurs in the multiprocessor system. There is problem solving technique called NP complete which is the optimal scheduling of parallel tasks with some precedence relationship onto parallel machines and can be solved only by using heuristic approach. The execution time requirements of the applications tasks are assumed to be stochastic. Genetic...
متن کاملGreen Energy-aware task scheduling using the DVFS technique in Cloud Computing
Nowdays, energy consumption as a critical issue in distributed computing systems with high performance has become so green computing tries to energy consumption, carbon footprint and CO2 emissions in high performance computing systems (HPCs) such as clusters, Grid and Cloud that a large number of parallel. Reducing energy consumption for high end computing can bring various benefits such as red...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IJBDCN
دوره 6 شماره
صفحات -
تاریخ انتشار 2010